Bayesian Model Averaging in Rule Induction

نویسنده

  • Pedro Domingos
چکیده

Bayesian model averaging (BMA) can be seen as the optimal approach to any induction task. It can reduce error by accounting for model uncertainty in a principled way, and its usefulness in several areas has been empirically veri ed. However, few attempts to apply it to rule induction have been made. This paper reports a series of experiments designed to test the utility of BMA in this eld. BMA is applied to combining multiple rule sets learned from di erent subsets of the training data, to combining multiple rules covering a test example, to inducing technical rules for foreign exchange trading, and to inducing conjunctive concepts. In the rst two cases, BMA is observed to produce lower accuracies than the ad hoc methods it is compared with. In the last two cases, BMA is observed to typically produce the same result as simply using the best (maximum-likelihood) rule, even though averaging is performed over all possible rules in the space, the domains are highly noisy, and the samples are mediumto small-sized. In all cases, this is observed to be due to BMA's consistent tendency to assign highly asymmetric weights to di erent models, even when their accuracy di ers by little, with most models (often all but one) e ectively having no in uence on the outcome. Thus the e ective number of models being averaged is much smaller for BMA than for common ad hoc methods, leading to a smaller reduction in variance. This suggests that the success of the multiple models approach to rule induction is primarily due to this variance reduction, and not to its being a closer approximation to the Bayesian ideal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Factors Affecting Tax Evasion in Iran's Economy Based on the Bayesian averaging approach

This study seeks to model tax evasion and identify how effective factors affect tax evasion in the Iranian economy. Recent models show the failure of traditional models; Models do not have enough ability to model hidden variables such as tax evasion. The present study considers this failure in identifying explanatory variables and experimental model design. To achieve this, the Bayesian averagi...

متن کامل

Predicting waste generation using Bayesian model averaging

A prognosis model has been developed for solid waste generation from households in Hoi An City, a famous tourist city in Viet Nam. Waste sampling, followed by a questionnaire survey, was carried out to gather data. The Bayesian model average method was used to identify factors significantly associated with waste generation. Multivariate linear regression analysis was then applied to evaluate th...

متن کامل

Bayesian Integration of Rule Models

Although Bayesian model averaging (BMA) is in principle the optimal method for combining learned models, it has received relatively little attention in the machine learning literature. This article describes an extensive empirical study of the application of BMA to rule induction. BMA is applied to a variety of tasks and compared with more ad hoc alternatives like bagging. In each case, BMA typ...

متن کامل

Factors Affecting Energy Intensity in Provinces of Iran: Bayesian Averaging Approach

The identification of the most important factors affecting energy intensity with the aim of controlling and managing energy consumption is an important topic. Findings of different empirical studies on the factors affecting energy intensity are inconsistent and this raises uncertainty about the employed models. One of the techniques that conform to these uncertainty conditions of the model is t...

متن کامل

Factors Affecting Energy Intensity in Provinces of Iran: Bayesian Averaging Approach

The identification of the most important factors affecting energy intensity with the aim of controlling and managing energy consumption is an important topic. Findings of different empirical studies on the factors affecting energy intensity are inconsistent and this raises uncertainty about the employed models. One of the techniques that conform to these uncertainty conditions of the model is t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997